OHASD: The First On-Line Arabic Sentence Database Handwritten on Tablet PC
نویسندگان
چکیده
In this paper we present the first Arabic sentence dataset for on-line handwriting recognition written on tablet pc. The dataset is natural, simple and clear. Texts are sampled from daily newspapers. To collect naturally written handwriting, forms are dictated to writers. The current version of our dataset includes 154 paragraphs written by 48 writers. It contains more than 3800 words and more than 19,400 characters. Handwritten texts are mainly written by researchers from different research centers. In order to use this dataset in a recognition system word extraction is needed. In this paper a new word extraction technique based on the Arabic handwriting cursive nature is also presented. The technique is applied to this dataset and good results are obtained. The results can be considered as a bench mark for future research to be compared with. Keywords—Arabic, Handwriting recognition, on-line dataset.
منابع مشابه
Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملOnline Database of Quranic Handwritten Words
In this paper, an online Arabic handwritten words database is presented to be the first online Quranic handwritten words dataset. The dataset was collected naturally using Acer computer tablet 1.5 GHz core i3 by writing the words on a smooth touch screen using a special pen stylus. Here, a platform interface was designed to collect the handwritten words using Matlab environment. Handwritten wor...
متن کاملON - LINE UNCONSTRAINED HANDWRITINGRECOGNITIONBASED ON PROBABILISTIC TECHNIQUESHomayoon
This paper discusses a probabilistic on-line handwriting recognition scheme, based on Hidden Markov Models (HMM's), and its implementation for recognizing handwritten words captured from a tablet. Statistical methods, such as HMM's have been used successfully for speech recognition. These methods have recently been applied to the problem of handwriting recognition as well. This paper, discusses...
متن کاملIsolated Persian/Arabic handwriting characters: Derivative projection profile features, implemented on GPUs
For many years, researchers have studied high accuracy methods for recognizing the handwriting and achieved many significant improvements. However, an issue that has rarely been studied is the speed of these methods. Considering the computer hardware limitations, it is necessary for these methods to run in high speed. One of the methods to increase the processing speed is to use the computer pa...
متن کاملKHATT: An open Arabic offline handwritten text database
A comprehensive Arabic handwritten text database is an essential resource for Arabic handwritten text recognition research. This is especially true due to the lack of such database for Arabic handwritten text. In this paper, we report our comprehensive Arabic offline Handwritten Text database (KHATT) consisting of 1000 handwritten forms written by 1000 distinct writers from different countries....
متن کامل